Asynchronous Multimodal Text Entry Using Speech and Gesture Keyboards
نویسندگان
چکیده
We propose reducing errors in text entry by combining speech and gesture keyboard input. We describe a merge model that combines recognition results in an asynchronous and flexible manner. We collected speech and gesture data of users entering both short email sentences and web search queries. By merging recognition results from both modalities, word error rate was reduced by 53% relative for email sentences and 29% relative for web searches. For email utterances with speech errors, we investigated providing gesture keyboard corrections of only the erroneous words. Without the user explicitly indicating the incorrect words, our model was able to reduce the word error rate by 44% relative.
منابع مشابه
Visual Error Resolution Strategy for highly-structured text entry using Speech Recognition in FP6-ALLADIN project
Man-Machine Interaction using only speech input is not well received by users, even for high performance recognizers (WER of about 2%). In most free text dictation application, attaining users intention is more important than specific speech tools performance, and low transaction success rate results in user’s rejection to speech interfaces [6]. For highly-structured text entry, users will bett...
متن کاملA Salience-Based Approach to Gesture-Speech Alignment
One of the first steps towards understanding natural multimodal language is aligning gesture and speech, so that the appropriate gestures ground referential pronouns in the speech. This paper presents a novel technique for gesture-speech alignment, inspired by saliencebased approaches to anaphoric pronoun resolution. We use a hybrid between data-driven and knowledge-based mtehods: the basic str...
متن کاملSpeech Is 3x Faster than Typing for English and Mandarin Text Entry on Mobile Devices
With laptops and desktops, the dominant method of text entry is the full-size keyboard; now with the ubiquity of mobile devices like smartphones, two new widely used methods have emerged: miniature touch screen keyboards and speech-based dictation. It is currently unknown how these two modern methods compare. We therefore evaluated the text entry performance of both methods in English and in Ma...
متن کاملInterpreting Gestures for Text Entry on Touch Screen Devices
Text entry on touch screen devices is often performed through Soft keyboards. One of the latest research trends is to abandon the traditional tapping interaction in favor of more natural gesture-based interactions on these keyboards. The interpretation of the gestures is performed through sketch-based techniques. In this paper we present the sketch-based technology related to the interpretation...
متن کاملBuilding and Testing Optimized Keyboards for Specific Text Entry
OBJECTIVE We explore how to optimally design systems for information input. BACKGROUND As computers are introduced into ever more devices with new methods of inputting information, there is a need for specialized systems that are optimally designed for their particular use. METHOD The study demonstrates how to use a model of text entry times to build optimized keyboards for specific sets of...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011